NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Estimating Enzyme Expression and Metabolic Pathway Activity in Borreliella -Infected and Uninfected Mice

https://doi.org/10.1089/cmb.2024.0564

Rondel, Filipp Martin; Farooq, Hafsa; Hosseini, Roya; Juyal, Akshay; Knyazev, Sergey; Mangul, Serghei; Rogovskyy, Artem S; Zelikovsky, Alexander (June 2024, Journal of Computational Biology)

Full Text Available
Scalable Reconstruction of SARS-CoV-2 Phylogeny with Recurrent Mutations

https://doi.org/10.1089/cmb.2021.0306

Novikov, Daniel; Knyazev, Sergey; Grinshpon, Mark; Icer, Pelin; Skums, Pavel; Zelikovsky, Alex (November 2021, Journal of Computational Biology)

Full Text Available
From Alpha to Zeta: Identifying Variants and Subtypes of SARS-CoV-2 Via Clustering

https://doi.org/10.1089/cmb.2021.0302

Melnyk, Andrew; Mohebbi, Fatemeh; Knyazev, Sergey; Sahoo, Bikram; Hosseini, Roya; Skums, Pavel; Zelikovsky, Alex; Patterson, Murray (November 2021, Journal of Computational Biology)

Full Text Available
Leveraging genomic diversity for discovery in an electronic health record linked biobank: the UCLA ATLAS Community Health Initiative

https://doi.org/10.1186/s13073-022-01106-x

Johnson, Ruth; Ding, Yi; Venkateswaran, Vidhya; Bhattacharya, Arjun; Boulier, Kristin; Chiu, Alec; Knyazev, Sergey; Schwarz, Tommer; Freund, Malika; Zhan, Lingyu; et al (December 2022, Genome Medicine)

Abstract Background Large medical centers in urban areas, like Los Angeles, care for a diverse patient population and offer the potential to study the interplay between genetic ancestry and social determinants of health. Here, we explore the implications of genetic ancestry within the University of California, Los Angeles (UCLA) ATLAS Community Health Initiative—an ancestrally diverse biobank of genomic data linked with de-identified electronic health records (EHRs) of UCLA Health patients ( N =36,736). Methods We quantify the extensive continental and subcontinental genetic diversity within the ATLAS data through principal component analysis, identity-by-descent, and genetic admixture. We assess the relationship between genetically inferred ancestry (GIA) and >1500 EHR-derived phenotypes (phecodes). Finally, we demonstrate the utility of genetic data linked with EHR to perform ancestry-specific and multi-ancestry genome and phenome-wide scans across a broad set of disease phenotypes. Results We identify 5 continental-scale GIA clusters including European American (EA), African American (AA), Hispanic Latino American (HL), South Asian American (SAA) and East Asian American (EAA) individuals and 7 subcontinental GIA clusters within the EAA GIA corresponding to Chinese American, Vietnamese American, and Japanese American individuals. Although we broadly find that self-identified race/ethnicity (SIRE) is highly correlated with GIA, we still observe marked differences between the two, emphasizing that the populations defined by these two criteria are not analogous. We find a total of 259 significant associations between continental GIA and phecodes even after accounting for individuals’ SIRE, demonstrating that for some phenotypes, GIA provides information not already captured by SIRE. GWAS identifies significant associations for liver disease in the 22q13.31 locus across the HL and EAA GIA groups (HL p -value=2.32×10 −16 , EAA p -value=6.73×10 −11 ). A subsequent PheWAS at the top SNP reveals significant associations with neurologic and neoplastic phenotypes specifically within the HL GIA group. Conclusions Overall, our results explore the interplay between SIRE and GIA within a disease context and underscore the utility of studying the genomes of diverse individuals through biobank-scale genotyping linked with EHR-based phenotyping.
more » « less
Full Text Available
Accurate assembly of minority viral haplotypes from next-generation sequencing through efficient noise reduction

https://doi.org/10.1093/nar/gkab576

Knyazev, Sergey; Tsyvina, Viachaslau; Shankar, Anupama; Melnyk, Andrew; Artyomenko, Alexander; Malygina, Tatiana; Porozov, Yuri B; Campbell, Ellsworth M; Switzer, William M; Skums, Pavel; et al (July 2021, Nucleic Acids Research)

Abstract Rapidly evolving RNA viruses continuously produce minority haplotypes that can become dominant if they are drug-resistant or can better evade the immune system. Therefore, early detection and identification of minority viral haplotypes may help to promptly adjust the patient’s treatment plan preventing potential disease complications. Minority haplotypes can be identified using next-generation sequencing, but sequencing noise hinders accurate identification. The elimination of sequencing noise is a non-trivial task that still remains open. Here we propose CliqueSNV based on extracting pairs of statistically linked mutations from noisy reads. This effectively reduces sequencing noise and enables identifying minority haplotypes with the frequency below the sequencing error rate. We comparatively assess the performance of CliqueSNV using an in vitro mixture of nine haplotypes that were derived from the mutation profile of an existing HIV patient. We show that CliqueSNV can accurately assemble viral haplotypes with frequencies as low as 0.1% and maintains consistent performance across short and long bases sequencing platforms.
more » « less
Full Text Available
Technology dictates algorithms: recent developments in read alignment

https://doi.org/10.1186/s13059-021-02443-7

Alser, Mohammed; Rotman, Jeremy; Deshpande, Dhrithi; Taraszka, Kodi; Shi, Huwenbo; Baykal, Pelin Icer; Yang, Harry Taegyun; Xue, Victor; Knyazev, Sergey; Singer, Benjamin D.; et al (December 2021, Genome Biology)

Abstract Aligning sequencing reads onto a reference is an essential step of the majority of genomic analysis pipelines. Computational algorithms for read alignment have evolved in accordance with technological advances, leading to today’s diverse array of alignment methods. We provide a systematic survey of algorithmic foundations and methodologies across 107 alignment methods, for both short and long reads. We provide a rigorous experimental evaluation of 11 read aligners to demonstrate the effect of these underlying algorithms on speed and efficiency of read alignment. We discuss how general alignment algorithms have been tailored to the specific needs of various domains in biology.
more » « less
Full Text Available
Unlocking capacities of genomics for the COVID-19 response and future pandemics

https://doi.org/10.1038/s41592-022-01444-z

Knyazev, Sergey; Chhugani, Karishma; Sarwal, Varuni; Ayyala, Ram; Singh, Harman; Karthikeyan, Smruthi; Deshpande, Dhrithi; Baykal, Pelin Icer; Comarova, Zoia; Lu, Angela; et al (April 2022, Nature Methods)

Full Text Available
Benchmarking of computational error-correction methods for next-generation sequencing data

https://doi.org/10.1186/s13059-020-01988-3

Mitchell, Keith; Brito, Jaqueline J.; Mandric, Igor; Wu, Qiaozhen; Knyazev, Sergey; Chang, Sei; Martin, Lana S.; Karlsberg, Aaron; Gerasimov, Ekaterina; Littman, Russell; et al (December 2020, Genome Biology)

Full Text Available

Search for: All records